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(57) Abstract 

Methods of designing or modifying pro- 
tein structure at the protein or genetic level to 
produce specified amino termini in vivo or in vit- 
ro. The m^ods can be used to alter the meta- 
bolic stability and other properties of the protein 
or, alternatively, to artificially generate authentic 
amino-tennini in proteins produced through ar- 
tificial means. The methods are based upon the 
introduction of the use of artificial ubiquitin- 
protein fusions, and the discovery that the in vivo 
half-life of a protein is a function of the amino- 
tenninal amino add of the protein. 
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METHODS OF REGULATING METABOLIC 
STABILITY OF PROTEINS 

Background of the Invention" 

In both bacterial and eukaryotic cells, rela- 
tively long-lived proteins, whose half -lives are 
close to or exceed the cell generation time, coexist 
with proteins whose half-lives can be less than one 
percent of the cell generation time. Rates of 
intracellular protein degradation are a function of 
the cell's physiological state, and appear to be 
controlled differentially for individual proteins. 
In particular, damaged and otherwise abnormal 
proteins are metabolically unstable in vivo. 
Although the specific functions of selective protein 
degradation are in most cases still unknown; it is 
clear that many regulatory proteins are extremely 
short-lived in vivo. Metabolic instability of such 
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proteins allows for rapid adjustment of their 
intracellular concentrations through regulated 
changes in rates of their synthesis or degradation. 
The few instamces in which the metabolic instability 
of an intracellular protein has been shown , to be 
essential for its function include the cll protein 
of bacteriophage lambda and the HO endonuclease of 
the yeast Saccharomyces cerevisiae. 

Most of the selective turnover of intracellular 
proteins xonder normal metabolic conditions is 
ATP^dependent and (in exikstryotes) nonlysosomal . 
Recent biochexoical and genetic evidence indicates 
that, in eukaryotes, covalent conjugation of xibi- 
quitin to short-lived intracellular proteins is 
essential for their selective degradation. The 
rules which determine whether a given protein is 
metabolically stable or unstable in vivo were 
previously unknown. 

SuTmnary of the Invention 

This invention pertains to methods of 
engineering the amino terminus of proteins thereby 
controlling the metsdDolic stability and other 
properties of a protein. Further , this invention 
provides a method for either in vivo or in vitro 
production of proteins with any of the twenty amino 
acid residues (or analogs thereof) at the protein's 
amino-terminus . The invention is based in part upon 
the striking discovery that the in vivo half-life of 
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an intracellular protein is a function of its 
amino-terminal amino acid residue and upon a novel 
(suad more generally appliccODle) technique that 
allows one to generate proteins with specified 
amino-termini in vivo or in vitro . The invention 
also pertains to a newly identified protease, 
ubiquitin-specif ic processing protease, which has 
properties. that allow one to expose, either in vitro 
or in vivo , any desired amino acid residue at the 
eunino-terminus of a protein of interest. 

The nature of the amino acid exposed at the 
amino-terminus of an intracellular protein was shown 
to be one crucial determinant that specifies whether 
a protein will be long- or short-lived in vivo. 
Individual amino acids can be categorized as either 
stabilizing or destabilizing amino acids with 
respect to the half-life that they confer upon a 
protein when exposed at the protein's amino- 
terminus. Destabilizing amino acid residues confer 
short half -lives, down to a few minutes for some of 
the destabilizing amino acids, stabilizing sunino 
acid residues confer long half-lives of memy hours. 
This striking and newly discovered* dependency of a 
protein's half -life on its amino-terminal residue is 
referred to herein as the N-end rule. 

Based upon the N-end rule, the amino-terminus 
of a protein can thus be designed or altered to 
change the intracellular half-life of the protein 
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and in this way the lifetime and/or activity of the 
protein in vivo can be regulated. This capability 
can be exploited for rational protein design in many 
different contexts. Natural or wild type proteins 
can be modified to render them more or less 
resistant to degradation in vivo . The design or 
alteration of the protein can be done at the protein 
level or at the genetic (DNA> level. For example, 
proteins can be modified by chemically altering or 
engineering the amiho-terminus to provide for 
exposxire at the amino-terminus of an amino acid 
residue of the stabilizing or destabilizing class. 
At the genetic level, .genes encoding proteins can be 
made to encode an amino acid of the desired class at 
the amino-terminus so that the expressed protein . 
exhibits a predetermined amino-terminal structure 
which renders it either metabolically stable or 
unstable with respect to the N-end rule pathway of 
proteolytic degradation. Furthermore, proteins can 
be expressed fused to a "maslcing'' protein sequence 
which masks the engineered amino-terminus so that 
when unmasked the protein will exhibit the desired 
stability or other properties that depend on the 
nature of the protein's amino-terminal residue, in 
such constructs, for example, the junction between 
the two protein sequences can be designed to be 
cleaved specifically, for instance, by an endopro- 
tease. Endoproteolytic cleavage of the fused 
sequence unmasks the 
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specifically engineered amino-temiinu^ of the' 
protein of interest and subjects the protein to 



degradation governed by the N-end rule. One spe- 
cific and new way to engineer the protein's amino- 
terminus is provided in this invention by the 
identification of \ibiquit in-specific processing 
protease and determination of its sxibstrate speci- 
ficity. Using this protease^ fusions of ubiquitin 
with other proteins can be specifically processed 
either in vitro or in vivo to generate proteins with 
desired eunino-terminal residues. 

A different, and also new way to specifically 
Engineer short-lived proteins is provided in this 
invention by the discovery that ubiquitin-protein 
fusions, such as \ibiquitin-Pro-^ -galactosidase, 
that cannot be efficiently deubiguitinated, are 
metabolically unstable. Thus, by attaching the 
amino-terminal ubiquitin moiety to a protein in a 
way that makes its removal either impossible or 
inefficient, one can desteibilize proteins by a 
distinct technicjue that is not directly based on the 
N-end rule. 

In addition, variant cells can be developed 
which contain putative mutations in the "N-end" 
degrading protease (s) which either conditionally or 
nonconditionally stop degrading short-lived pro- 
teins. These cells can be used to overproduce 
proteins that ordinarily would be short-lived within 
the cell. 
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Brief Description of the Figures 

Figure 1 shows the construction of ubiquitin- 
lacZ gene fusions. 

Figure 2 shows experiments in which the half- 
lives of engineered ^ -gaJ. proteins aure directly 
measured. 

Figure 3 shows the changing of amino acid 
residues at the ubiquitin-- ^gal junction (A) using 
the newly discovered properties of ubiquitin- 
specific processing protease and the eoaino acid 
sequence in the vicinity of the junction (B) . 

Figure 4 shows the presence of multiple ubi- 
quit in moieties in met2U30lically unstable ^-gal 
proteins • 

Figure 5 shows a series of y3-gal species 
containing ubiquitin in metabolically tinstable 
^-gal proteins. 

Figure 6 shows that both prokaryotic and 
eutauTfotic long-lived intracellxilar proteins have 
stabilizing amino acid residues at their amino 
termini whereas secreted proteins exhibit a com- 
plementary bias. 

Detailed Description of the Invention 

The elucidation of the K-end rule is described 
in detail below. Briefly, this rule governing 
protein degradation was revealed by examining the in 
vivo half -lives of the enzyme ^-galactosidase having 
various amino acid residues at its amino- terminus 
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and produced as a fusion protein with ubiquitin. 
When a chimeric gene encoding a iibiquitin- p -galac- 
tosidase fusion protein is expressed in the yeast S« 
cerevisiae, ubiquitin is cleaved off the nascent 
fusion protein, yielding a deubiquitinated-^-galac- 
tosidase ( ^gal) . With one exception, this clea- 
vage takes place regardless of the nature of the 
amino acid residue of ^gal at the ubiquitin-^ gal 
junction, thereby making it possible to expose 
selectively different residues at the suaino-termini 
of otherwise identical ^gal proteins. The gal 
proteins so designed exhibited strikingly different 
half-lives in vivo , ranging from more than 20 hours 
to less than 3 minutes, depending upon the nature of 
the amino acid at the amino-terminus of ^gal. 
Amino acids can be thus ordered according to the the 
half lives they confer on ^gal when present at its 
amino-terminus • For exaunple, the amino acids 
methionine, serine, alanine, threonine, valine, 
glycine and cysteine confer a half -life of more than 
20 hours. Phenylalanine, leucine, asparagine, emd 
lysine yield half lives of about three minutes. 
Arginine, the most destabilizing amino acid, confers 
a half life of about two minutes. (See Table 1 below 
for complete list of amino acids and the correspond- 
ing half -lives) 

Currently known amino- terminal residues in 
long-lived, noncompartmentalized intracellular 
proteins from both prokaryotes and eukaryotes belong 
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virtually exclusively to the stabilizing class of 
amino acids, exactly as predicted by the N-end rule. 
This result stongly implicates the N-end rule in the 
selective degradation of intracellular proteins in 
general. 

The appropriate amino-terminal amino acid 
appears to be an essential (though not necessarily a 
sufficient) requirement for the metiabolic stability 
of a noncompartmentalized, intracellular protein* 
Thus, in order for a protein to be relatively stable 
intracellularly, a stabilizing amino acid should be 
present at the amino-terminus. The presence of a 
destabilizing residue at the amino-terminus of a 
protein is often, though not always, sufficient for 
its metabolic destabilization in vivo. When such 
destabilization occurs to a relatively small extent, 
further analysis shows either an insufficient 
accessibility of the amino-terminus or a lack of 
"permissible" sequence environment in the vicinity 
of the amino-terminus such as a lack of segmental 
mobility in the protein ^s amino-terminal region. 
The presence of a stabilizing amino acid at the 
amino-terminus at least in some cases (for instance, 
as observed for ^-gal) will confer stabUity upon 
the protein. However, a stabilizing amino acid at 
the amino-terminus may not always confer a long 
half-life because other degradative pathways may be 
involved in. determining the ultimate fate of pro- 
tein. For example, endoproteolytic cleavages 
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(cleavages outside of terminal • regions of the 
protein) may result in exposure of a destabilizing 
amino acid at the amino-terminus of a resulting 
product of the cleavage which is then rapidly 
degraded via the N-end rule pathway. Appropriate 
circumstances for use of a stabilizing amino acid 
can be ascertained empirically. 

Although the N-end rule may be only one com- 
ponent (albeit a central one) of a more complex 
"half-life rule" which embraces other aspects of 
selective protein degradation in vivo, the N-end 
rule provides a rational, practicable approach for 
designing or changing protein structure in order to 
produce proteins which are more or less resistant to 
degradation by the N-end rule pathway than natural, 
unmodified protein. Proteins can be' designed or 
modified at the protein or gene level to provide a 
desired amino acid of either the stabilizing or 
destabilizing class at their amino-terminus. The 
ability to regulate the half-life of a protein will 
allow one to modulate the intracellular activity of 
the protein. 

A straightforward approach to modifying a 
protein to increase or decrease its metabolic 
stability is to directly engineer the amino-terminus 
of the protein at the protein level. To provide a 
desired amino-terminal amino acid, the amino- 
terminus of the protein of interest can be chemical- 
ly altered, for example, by adding an amino acid of 



• 
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the stabilizing or destabili2ing class to the 
amino-terminxis of a protein or polypeptide, em- 
ploying an appropriate chemistry. Thus, for ex- 
ample, an unstable protein can be rendered more 
stable by adding a stabilizing amino acid residue 
Ce.g. methionine serine, alanine, threonine, valine 
glycine or cysteine) to the amino-terminus of the 
protein. Conversely, a stable protein can be 
destabilized by adding a destabilizing amino acid to 
the amino-terminus. one distinct way to modify the 
amino-terminus of a protein would be to employ spe- 
cific enzymes, amino acid-protein ligases, which 
catalyze posttranslational addition of a single 
amino acid to the protein amino-terminus. Other 
methods for nongenetic alterations of the same type 
can readily be ascertained by those skilled in the 
art. 

In some proteins, the amino-terminal end is 
obscured as a result of the protein's conformation 
(i.e., its tertiary or quaternary structure). In 
these cases, more, extensive alteration of the 
amino-terminus may be necessary to make the protein 
subject to the W-end rule pathway. For example, 
where simple addition or replacement of the single 
amino-terminal residue is insufficient because of an 
inaccessible amino-terminus, several amino acids 
(including lysine, the site of ubiquitin joining to 
substrate proteins) may be added to the initial 
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amino-tenainus to increase the accessibility and/or 
segmental mobility of the engineered amino-terminus . 

.Modification or design of the amino-terminus of 
a protein can also be accomplished at the genetic 
level. Conventional techniques of site directed 
mutagenesis for addition or substitution of ap- 
propriate codons to the 5* end of an isolated or 
synthesized gene can be employed to provide a 
desired amino-terminal structure for the encoded 
protein • For example, so that the protein expressed 
has the desired amino acid at its cuaino-terminus the 
appropriate codon for a stabilizing cimino acid can 
be inserted or built into the amino-terminus of the 
protein-encoding sequence. 

At the same time, expressed proteins are often 
naturally modified within a cell after translation. 
This can include modification of the amino-terminus. 
For example, the eunino-terminus can be acted on by 
an cuninopeptidase which cleaves one or several amino 
acids from the euaino-terminus . Amino acids may also 
be added to the amino-terminus by post-translational 
processing. This invention provides a way to 
"by-pass" still undefined rules of eaaino- terminal 
protein processing to expose exactly and specifical- 
ly the desired eunino acid residues at the 2unino- 
terminus of a mature processed protein species. To 
minimize the impact of such posttranslational events 
on the ultimate structure of the amino-terminus of a 
protein of interest, specific fusion proteins can be 
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designed wherein the amino-terminus of a protein of 
interest (designed to have the desired stabilizing 
or destabilizing structure) is preceded by a 
"masking" protein sequence fused to the amino- 
terminus. The fusion proteins are designed so that 
the protein sequence fused to the amino-terminus of 
the masking protein of interest is susceptible to 
specific cleavage at the junction between the two. 
Removal of the protein sequence thus unmasks the 
amiino-terminus of the protein of interest and the 
^^l^-li^te of the released protein is thus governed, 
by the predesigned amino-temainus. The fusion 
protein ,can be designed for specific cleavage in 
vivo, for example, by a host cell endoprotease or 
for specific cleavage in a in vitro system where it 
can be cleaved after isolation from a producer cell 
(which lacks the capability to cleave the fusion 
protein) . 

Ubiquitin is a broadly useful fusion partner 
for construction of a fused protein with a protein 
of interest: the discovery that artifical ubiquitin- 
protein fusions can be cleaved precisely by a 
cytoplasmic eukaryotic protease with little or no 
dependence on the protein to which ubiquitin is 
fused can be applied both in vivo and in vitro in 
protein engineering strategies, and is a major 
aspect of this invention. For example, the ubiquitin- 
protein fusion method can be used. to. airtifically 
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generate authentic amino-termini in proteins pro- 
duced through artifical means. Thus, amino-termini 
characteristic of natural eukaryotic or prokaryotic 
proteins can be generated by in vitro cleavage of 
ubiquitin-protein fusions produced in a prokaryotic 
host . 

A specific methodology for producing ubiquitin- 
^-galactosidase fusion proteins is described in 
detail below. Genes encoding any other proteins can 
be substituted for LacZ (the ^ -gal gene) in this 
me thodo 1 ogy • 

In general, ubiquitin fusion proteins are 
expressed by a chimeric gene construct comprising, 
in 5' to 3' orientation, a ubiquitin gene linked to 
a gene encoding the protein of interest. The codon 
for the amino-terminal amino acid of the protein of 
interest is located immediately adjacent the 3' end 
of the ubiquitin gene. The fused gene product is 
cleaved endoproteolytically either in vivo or in 
vitro (using either pure br partially purified 
ubiquitin-specific protease identified in the 
present invention) at the junction between ubiquitin 
and the protein of interest to generate the protein 
of interest having the desired eimino acid at its 
amino-terminus. There are a number of specific uses 
for the described ability to specifically engineer 
the protein's amino-terminus. One such use is 
established by the fact that the intracellular 
half-life of the released protein is governed by the 
principles of the N-end rule. Other applications of 
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the specific method for engineering the protein 
amino-teminus described herein range from adjusting 
the desired functional properties of a protein of 
interest, to modulating its antigenicity, and again, 
to other uses that can readily be aiscertained by 
those skilled in the airt. 

This method of. generating the desired amino 
acid residue at the amino-terminus of a protein of 
interest involves two novel components: one, the 
use of ubiquitin-protein fusions, and the other, the 
use of ubiquitin-specif ic processing protease that 
has been identified, and whose striking substrate 
requirements were discovered, in this work. Al- 
though the initial identification of the ubiqui tin- 
specific protease has been made in vivo, the enzyme 
is also relatively stable and active in vitro (in 
extracts), and can readily be purified to homogeneity 
by techniques known to those skilled in the art. 
Furthermore, the substrate specif ici^ of the ' 
ubiquitin-specific processing protease is highly 
conserved in evolution, being the same in yeast and 
mammals. The enzyme can be purified chromato- 
graphically from a crude extract by sequential 
chromatography on phosphocellulose, DEAE cellulose, 
and SH Sepharose among other methods known to those 
skilled in the art. Alternatively, the gene- for 
this protease can be cloned by those skilled in the art. 
The cloned protease gene can be used either in vivo . 
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or, alternatively, the gene can be overexpressed in 
a suitable host, the overexpressed ubiquit in- 
specific protease purified and used for the same or 
similar purposes in vitro . The discovery of this 
enzyme, and detailed characterization of its sub- 
strate specificity herein provides for the in vitro 
"and in vivo use of this enzyme. 

The use of ubiquitin-protein fusions to allow 
the generation of a desired amino acid residue at - 
the amino-terminus of a protein of interest can be 
extended to facilitate the purification of such 
proteins from producer cells. A gene can be readily 
constructed that encodes a convenient marker pro- 
tein, such as streptavidin, linked to a ubiquitin- 
protein fusion construct described above. The 
resulting (marker protein) -ubiquitin-protein fusion 
can be simply isolated from producer cells by using 
the preselected property of the marker protein, for 
instance, the known ability of streptavidin to be 
isolatable by affinity chromatography on a biotin 
column. Thus, purified (marker protein) -ubiquitin- 
protein fusion can then be specifically cleaved by 
the ubiquitin-specific protease described in this 
invention to generate the final product, a protein 
of interest with the desired amino acid residue at 
its amino-terminus . 

The codon for the amino-terminal amino acid of 
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desired amino acid by, for example, site-directed 
mutagenesis technicpies currently standard in the 
field. If the gene encoding the protein of interest 
is a synthetic gene the appropriate 5' codon can be 
built-in during the synthetic process. Alter- 
natively, nucleotides for a specific codon can be 
added to the 5' end of an isolated or synthesized 
gene by ligation of an appropriate DNA seG[uence to 
the 5" (amino-terminus encoding) end of the gene. 

Ubiquitin-like fusion partners capable of being 
cleaved by the ubiquitin-specific protease can also 
be used. In addition, fusion partners other then 
xibiqiiitin for masking the amino-terminus of a 
protein of interest can be used. In appropriate 
cases, the fusion proteins can be designed to 
contain a proteolytic cleavage site for a restric- 
tion endopro tease which has. sufficiently narrow 
specificity so that only one target site is cleaved 
in a fusion protein. A crucial property of such a 
protease must be a sufficiently releuced requirement 
for the nature of the cuaino acid residue (s) cODUtting 
the carboxy-terminal side of the cleavage site. The 
target site for cleavage is the junction between the 
fusion partner emd the amino-terminus of the protein 
of interest and thus the recognition site for the 
endoprbtease is located to provide for cleavage at 
this location. The commerically available protease, 
complement factor X^, exhibits these properties and 
- thus can be used to directly generate proteins with 
predetermined amino acid residues in the ultimate 



position of their amino-termini (see, K. Nogai and 
H.C. Thogersen Nature 309 :810 (1984)). The recog- 
nition site for the endoprotease can be engineered 
into the jtinction between the masking protein 
sequence and the 3' region encoding the amino- 
terminus of the protein of interest. 

A different and distinct method for engineering 
short-lived proteins is provided in this invention 
by the discovery that ubiquitin-protein fusions, 
such as ubiquitin-Pro-^-galactosidase fusion (TeO^le 
1) , that cannot be efficiently deubiquitinated are 
metabolically unstable. Thus, by attaching the 
amino-terminal ubiquitin moiety to a protein in a 
way that makes its removal either impossible or 
inefficient, one can destabilize a protein by a 
distinct technique which is qualitatively different 
from the method of generating the desired 
amino-terminus of a protein according to the 
requirements of the N-end rule. Prevention of the 
efficient deubiquitination of a ubiquitin-protein 
fusion can be achieved in several ways, for in- 
stance, by using a proline residue at the 
ubiquitin-protein junction as shown in Table 1, or 
by changing the amino acid sequence of ubiquitin 
near its carboxyl -terminus in such as way that the 
ubiquitin moiety is no longer recognized by the 
ubiquitin-specific processing protease but can still 
be recognized by the rest of the degradative path- 
way. These and other ways to reduce the rate of 
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deubiquitination of a ubiquitin-proteln fusion can 
be readily ascertained by those skilled in the art. 

The methods of this invention can be employed, 
inter alia, for regulating the half-life of a 
protein intracelluiarly. There are many instances 
Where this capabUity is useful; For example, when 
a gene is introduced into a cell for expression 
therein, the expressed product can be designed for a 
long or short half-life depending upon the particu- 
lar need. 

In general, destabilized proteins which have 
short half-lives are more amenable to regulation of 
intracellular levels of the protein. The ability to 
finely regulate the intracellular levels and ac- 
tivity of a protein can be useful in therapy or in 
. the work with in vitro cell cultures, in gene 
therapy, for example, a gene may be introduced into 
a cell to compensate for a genetic deficiency or 
abnormality. The gene can be inserted under control 
of an inducible promoter. Induction results in 
enhanced e^^ression of the gene product and con- 
sequently, higher levels of the product within the 
cell. If the gene is designed to encode an unstable 
protein, the intracellular concentration of the 
expressed protein will be more quickly responsive to 
a later reduction in the rate of its synthesis 
because it does not persist within the cell, m 
this way, the intracellular level and/or activity of 
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the protein encoded by the inserted gene can be more 
finely regulated. 

The method of this invention can also be used 
to expand the uses of selectable markers by- 
shortening the time necessary for a phenotype 
related to the marker to become manifest. Toward 
this end^ a product encoded by a marker gene can be 
destabilized by altering its amino-terminus ac- 
cording to the N-end rule. In this way selection 
for the negative phenotype can be facilitated 
because the product of the marker gene will be more 
quickly extinguished after fxinction of the gene 
encoding the marker is abolished. An example is the 
thymidine kinase (tk) gene The tk gene can be 
engineered to encode a less stable enzyme by intro- 
ducing an appropriate destabilizing amino acid at 
the amino-terminus. Gene mutation resulting in tk" 
phenotype will be more quickly manifested by cells 
because residual tk'will be more quickly degraded. 
This can be especially useful in slow growing cells 
where more time is required to "dilute out" tk 
synthesized prior to transformation to the tk" type. 

The* principles of protein modification based 
upon the N-end rule may also be employed in the 
design of cytotoxins. Proteinaceous cytotoxins can 
be designed as xinstable proteins degradable by the 
N-end rule pathway so that they do not persist after 
their toxic action has been exerted on a target 
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cell. Reducing, the lifetime of the toxin reduces 
the likelihood of killing hontargeted cells. 

Discovery of the N-end rule pathway of degra- 
dation allows development of mutant cells having 
mutations in genes encoding essential components of 
the N-end rule pathway. For example, cells can be 
produced that either permanently or conditionally 
are unable to efficiently degrade otherwise short- 
lived proteins. These cells can be used to produce 
desired proteins that ordinarily would be unstable 
within a cell. 

The invention is illustrated further by the 
following detailed description of the elucidation of 
the N-end rule. 

Methods 

Protein Secruenelnep 

S- cerevisiae cells carrying pUBi23 (Fig. l) , 
which encodes ub-Met-^gal (Fig. 3A) , were labeled 
with [ S] methionine, followed by extract pre- 
paration, immunoprecipitation of ^gal and elect- 
rophoresis as described below. The wet polyacryla- 
mide gel was subjected to autoradiography, the band 
of ^gal was excised, and the electroeluted ^gal was 
subjected to six cycles of radiochemical sequencing 
by Edman degradation. The sequencing was carried 
out by w. Lane at the MicroChem Facility of Harvard 
University.. 
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Site-dlrected Mutagenesis 

pUB23 (figure 1) was treated sequentially with 
Acc I, the Klenow fragment of pol I, and Bam HI. A 
fragment containing the Xho I site was purified and 
inserted between a filled-in Hind III site and a BAM 
HI site of the M13mp9 phage DNA. (J. Messing and J. 
Vieira, Gene 19, 263 (1982) )• Site-directed muta- 
genesis (M. Smith, Annu. Rev . Genet . 19, 423 (1985)) 
was carried out as described by Kramer, W. et al. 
Nucl. Acids Res , 12, 9441 (1984) using a synthetic 
25-residue oligodeoxyribonucleotide, containing ten 
bases on the 5* side and twelve bases on the 3' side 
of the Met codon of ^gal. All four bases were 
allowed to. occur at the original Met codon positions 
during synthesis. Primary phage plagues were 
screened by hybridization (Wood, N.I. et al. PNAS 
82, 1585 (1985)), with the use of a 12-residue 
oligonucleotide probe spanning the region of codon 
changes and hybridizing to the original sequence. 
Nonhybridizing plaques containing inserts of the 
expected size were sequenced by the chain termina- 
tion method. (Sanger, F. et al. , PNAS 71 5463 
(1977)). To transfer -the desired constructs into 
the pUB23 background, replicative form DNA of mutant 
phages was digested with Xho I and Bam HI, and added 
to the same digest of the plasmid pLGSD5-ATG (see 
Fig. 1 and L. Guarente, Methods Enzymol . , 101 181 
(1983)). The ligated mixture was used to transform 
the E. coli strain MC1061. (M.J. Casadaban and S-N. 
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Cohen, J. Mol. Biol. , 138 179 (1980)). Colonies 
containing plasmids of interest (in which the open 
reading frame of ^gal had been restored) were 
recognized by their light blue color on X-gal 
plates. 

Pulse^'Chase Experiments 

S. cerevisiae cells of the strain BWG-9a-l (MM 
his4 ura3 adeS) , transformed (F. Sherman et al . 
Methods in Yeast Genetics Cold Spring Harbor Labo- 
ratory, N»Y. , 1981) ) with plasmids of interest were 
grown at 30°C to A^^^ of approximately 5 in a medium 
of 2 percent galactose, 0.67 percent Yeast Nitrogen 
Base without amino acids (DIFCO) , adenine (10 Mg/ml) 
and amino acids including methionine (Sherman, F. et 
al., supra) . Typically, cells, from a 5 ml culture 
were harvested by filtration through the well of a 
Millipore microtiter filtration plate, washed 
several times on the filter with the same medium 
lacking methionine and resuspended in 0.3 ml of 1 
percent galactose, 50 mM potassium phosphate buffer 
(pH 7.4). [^^S]methionine (50 to 100 jaCi) was then 
added for 5 minutes at 30^C; the cells were col- 
lected by filtration and resuspended on 0.4 ml of 
the growth medium containing cycloheximide at 0.5 
mg/ml. Samples (0.1 ml) were withdrawn at indicated 
times, and added to 0.75 ml of cold buffer A (see 
below for buffer composition) containing leupeptin, 
pepstatin A, antipain, aprotinin and chymostatin 



(Sigma), (each at 20Mg/iiil) in addition to 0.4 ml of 

glass beads. Immediately thereafter, the cells were 

disrupted by vortexing for approximately 3 minutes 

at 4^0; the extracts were centrifuged at 12,000g for 

3 minutes and the radioactivity of acid-insoluble 
35 

S in the supematants was determined. Aliquots of 

the supematants containing ec[ual amounts of the 

35 

total acid-insoluble S were processed for immuno- 
precipitation with a monoclonal antibody to gal. 
Ascitic fluid containing a molar excess of the 
cmtibody (at least tenfold) was added to each 
aliquot, with stibsequent incubation at 4*^C for 2 
hours; protein A-Sepharose (Pharmacia) was then 
added, the suspension was incubated with rocking at 
4°C for 30 minutes and centrifuged at 12,000g for 1 
minute. The protein A-Sepharose pellets were washed 
three times in buffer A (see below) containing 0.1 
percent sodium dodecyl sulfate (SDS) , resuspended in 
an SDS, dithiotreitol (DTT) -containing electro- 
phoretic sample buffer (U.K. Laemmli, Nature 227 680 
(1970)), heated at lOO^C for 3 minutes, and cen- 
trifuged at 12,000g for 1 minute. Equal aliquots of 
the supematants were subjected to electrophoresis 
in a 7 percent discontinuous polyacrylamide-SDS gel 
(15 by 15 by 0.15 cm) with subsequent flourography. 
In some experiments, the above protocol was not 
used, but the extracts were prepared by boiling 
cells directly in the presence of SDS, with es- 
sentially the same results. 
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Analysis of xib- ^cral proteins produced in E. Coli 

Plasmid ptJB23 (Figs, l and 3) was intxoduced 
into DS410^ a minicell-producing E. coli strain. 
(N. Stoker, et al. , in Transcription and Trans- 
lation; A practical Approach B. p. Hames and S. J. 
Higgins,. Eds., IKL press, Oxford, 1984, p. 153). 
Minicells were prepared and labeled for 60 minutes 
at 36^C witH C^^S]methionine ( 600 Ci/mmole, 
Amersham) as described by N. Stoker et al., supra . 

Labeled minicells were centrifuged, resuspended 
in 2 percent SDS, 10 DTT, 10 mM Na-HEPES (ph 7.5) 
and heated at lOO^C for 3 minutes. After centri- 
fugation at 12,000g for 1 minute the supernatant was 
diluted 20-fold with buffer A (1 percent Triton 
X-100, 0.15 M NdCl, 5 mM Na-^EDTA^ 50 mM Na-HEPES, ph 
7.5), followed by the addition of phenylmethyl- 
sulfonyl fluoride (PM5F) and N-ethylmaleimide to 0.5 
mM and lOmM, respectively. After 4 hoxirs at 4*^C, 
the sample was dialyzed against buffer A containing 
0.5 mM PMSF overnight at 4°C, and processed for 
immunoprecipitation (as described above) . 

Analysis of ub- j tfcral proteins produced in veast 

S. cerevisiae cells carrying plasmids of interest 
were grown in 800 ml of a uracil-deficient medium, 
then hairvested and disrupted with glass beads in 
buffer A containing leupeptin, pepstatin A, anti- 
pain, aprotinin and chymostatin (each at 3 /Cg/ml) . 
The extract was centrifuged at 12,000g for 3 
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minutes. Saturated ammonium sulfate was added to 
the supernatant to a final concentration of 57 
percent. After overnight incubation at 4°C, the 
precipitated protein was collected by centrifugation 
at 23,000g for 30 minutes. The pellet was redis- 
solved in buffer A containing protease inhibitors. 
After clarification at 12,000g for 3 minutes, the 
sample was passed through an affinity column which 
had been prepared by crosslinking an IgG fraction 
from an ascitic fluid (containing a monoclonal 
antibody to gal to Affi-Gel 10 (Bio-Rad) . The IgG 
fraction used for crosslinking had been purified 
from the ascitic fluid by affinity chromatography on 
protein A-Sepharose. After washing with buffer A 
lacking Triton X-100, the antibody-bound proteins 
were eluted with 0.25 M glycine-HCl (pH 2.6). The 
eluate was immediately adjusted to pH 7.5 with 1 M 
Na-HEPES (pH 8.5), and thereafter made 0.1 percent 
in SDS. The sample was concentrated by ultrafil- 
tration in Centricon 30 (Amicon) , and subjected to 
electrophoresis in a 7 percent discontinuous poly- 
acrylamide-SDS gel (U.K. Laemmli, Nature (London) 
227, 680 (1970)0 • Electroblotting of proteins to 
nitrocellulose, and immunoblot analysis with a 
peptide-mediated antibody to ubiquitin were per- 
formed as described by P.s. Swerdlow, D. Finley and 
A. Varshavsky, Analyt . Biochem . 156, 147 (1986). 
The same results were obtained with a different 
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antibody to ubiquitin obtained from A. Haas (Univ of 
Milwaukee Med". School) . 

Detailed Description of the Figures 

Figure 1 shows construction of a ubiquitin-lac2 
gene fusion. pUB2, a pBR322-based genomic DNA clone 
(E. Ozkaynak, et al. Nature 312 , 663 (1984) contains 
six repeats of the yeast ubiquitin-coding sequence 
(open boxes) together with the flanking regions 
(jagged lines) . pUB2 was modified as shown in the 
diagram by placing a Bam HI site six bases down- 
stream from the first ubiquitin repeat. This 
allowed the construction of am in-frame fusion 
(confirmed by nucleotide sequencing) between a 
single ubiquitin repeat and the lacZ gene of the 
expression vector pLGSD5-ATG (called G2 in L. 
Guarente, Methods Enzvmol . loi 181 (1983)). The 
term «2 /«.m«' denotes a region of the pLGSD-ATG that 
contains the replication origin and flanking se- 
quences of the yeast plasmid called 2 /*m circle (See 
L. Guarente, supra) . Figure 33 shows the amino acid 
sequence of the fusion protein in the vicinity of 
the ubiquitin-^al junction. 

Figure 2 shows that the in vivo half-life of • 
gal is a function of its amino-terminal residue, 
(lane a) Minicells isolated from an E. coli strain 
carrying pUB23, the initial ub -lacZ fusion (Figs, i 
and 3B), were labeled with C^^S] methionine for 60 
minutes at 36°C, with subsequent analysis of ^gal as 
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described. The same result was obtained when the 

labeled minicell SDS extract was combined with an 

xinlabeled yeast SDS extract before immunoprecipita- 

tion of gal, (lane b) S. cerevisiae cells carrying 

pUB23 (Fig. 1) , which encodes lab-Met-^gal (Fig. 3B) , 

were labeled with [^^S] methionine for 5 minutes at 

30*^C, with subsequent analysis of ^gal. The same 

result was obtained with the lengths of the 
35 

[ S] methionine labeling periods from 1 to 30 
minutes, and with yeast extracts produced either by 
mechanical disruption of cells in the presence of 
protease inhibitors or by boiling the cells directly 
in an SDS-containing buffer, (lane c) Same as lane 
a but with E. coli cells carrying the control 
plasmid pLGSD5 (called Gl in L. Guarente, supra .) 
which encodes gal. (lanes d to g) S. cerevisiae 
cells carrying pUB23 (Fig. 1) , which encodes ub-Met- 
^gal (Fig. 3A) , were ledDeled with [^^S] methionine 
for 5 minutes at 30°C (lane d) followed by a chase 
in the presence of cycloheximide for 10, 30, and 60 
minutes (lanes e to g) , extraction, immunoprecipita- 
tion, and analysis of ^al. (lanes h to j) Same as 
lanes d to f , but with ub-Ile-^gal (see Fig. 3 A) . 
(lanes k to m) Same as lanes h to j, but with 
xib-Gln-^gal . (lemes n to q) Same as lanes d to g, 
but with ub-Leu-^gal. (lanes r to u) Same as lanes 
d to g, but with ub-Arg-^gal. Designations: ori; 
origin of the separating gel; ub, ubiquitin; ^gal, 
an electrophoretic band of the ^gal protein 
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containing a specified amino-tenainal residue; in 
this terminology, the Met-^gal portion of ub-Met- 
^gal is designated as ^ gal. Arrowheads denote a 
met cibolically stable, about 90kD degradation product 
of ^gal which xs formed apparently as the result of 
an in vivo endoproteolytic cleavage of a proportion 
of short-lived ^gal proteins such as Leu-^gal and 
Arg-^gal (lanes n to u) • 

Figure 3 shows the changing amino acid residues 
of gal at the xibiquitin- gal junction. (A) The 
initial plasmid, pDB23 (Fig.l),. which encodes 
ub-Met-^gal, was mutagenized as described above to 
convert the original Met codon ATG at the ub-^gal 
junction into codons specifying 19 amino acids other 
than Met.- (The original round of mutagenesis shown 
in Figtire 3, produced 15 out of 19 possible sub- 
stitutions. The remaining four substitutions were 
produced later (see Table 1)). The arrowhead in- 
dicates the site of the deubiquitinating in vivo 
cleavage in the nascent fusion protein that occurs 
with all of the fusion proteins except ub-Pro- gal 
(see text) . All of the constructions shown encode 
His as the second gal residue. In addition, in 
some of the constructions (ub-Met-His-Gly-^gal, 
ub-Met-Gln-Oly-^gal, and iib-Met-Gln-His-Gly-^gal, 
the. last one produced by an insertion mutation; see 
Table 2) , either His. or Gin were following Met at 
the ubiquitin-^gal junction, with indistinguishable 
consequences for the metabolic stabilities of the 
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corresponding gal proteins. (B) The amino acid 
sequence (in single-letter abbreviations) of ub-Met- 
^gal, the initial fusion protein (Fig. 1), in the 
vicinity of the ub-^gal jxinction. Single-letter 
amino acid abbreviations: A, Ala; C, Cys; D, Asp; E, 
Glu; F, Phe; G, Gly; His; I, lie; K, Lys; Leu; 
M, Met; N, Asn; P, Pro; Q, Gin; R, Arg; S, Ser; T, 
Thr; V, Val; Trp; Y, Tyr. 

Figure 4 shows that ubiquitin- gal is short- 
lived if not deubiquitinated. (lanes a to g) S. 
cerevisiae cells carrying plasmids encoding 
iib-X-^gal fusion proteins in which X is the residue 
indicated at the top of each lane, were labeled for 
5 minutes at 30^C with [^^S] methionine, followed by 
extraction, immunoprecipitation and analysis of 
pgal. Fluorographic exposures for these lanes were 
several times longer than those for similar patterns 
in Figure 2 to reveal the multiple ubiquitination of 
short-lived ^gal proteins. (lanes h, i) Fluoro- 
graphic overexposure of lanes n, o in Fig. 2 to 
reveal the "ladder" of multiply ubiquitinated 
Leu- ^ gal proteins in a pulse-chase experiment (zero 
and 10 minutes chase, respectively), (lane j) Same 
as lanes a to g, but with lab-Pro-^al. (lane k) 
Same as lane j, but with ub-Gln-^al. (lane 1) Same 
as lane j. (lanes m to p) s. cerevisiae cells 
carrying a plasmid encoding ub-Pro-^gal were labeled 
for 5 minutes at 30°C with [^^SJ methionine (lane m) 
followed by a chase in the presence of cycloheximide 
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for 10, 30, and 60 minutes (lanes n to p) . The 
upper small arrow to the right of lane p denotes 
ub-Pro-^gal, a small proportion of which is still 
present after 1 hour chase. The lower small arrow 
indicates an apparently deubiquitinated Pro-^gal 
that slowly accumulates during chase and is meta- 
bolically stable. The dot to the left of lane m 
denotes an endogeneous yeast protein that is pre- 
cipitated in some experiments by the antibody used. 
Square brackets denote the multiply vibiquitinated 
^-gal species (see Fig. 5). Other designation are 
as in Figure 2. 

Figure 5 shows the "ladder" gal species 
containing ubiquitin. (lane a) S. cerevisiae cells 
carrying a plasmid which encodes ub-Gln-^al, were 
grown and disrupted, and the extracts processed for 
isolation of ^gal proteins by affinity chromato- 
graphy on a column with immobilized antibody to 
^gal. The ^gal proteins thus obtained were 
electrophoresed in a polyacrylamide-SfDS gel, trans- 
ferred to nitro-celluiose, and probed with an 
antibody to ubiquitin. (lane b) Same as lane a, but 
with ub-Pro-;^gal. (lane c) Same as b but a longer 
autoradiographic exposure, (lane d) S. cerevisiae 
cells carrying a plasmid which encodes ub-Leu-flgal 
were labeled with [^^S J methionine for .5 minutes, 
with subsequent extraction, immunoprecipitation and 
electrophoresis • of ^gal (the same sample as in 
Figure 4, lane f) . Square brackets denote the 
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multiply xibiquitinated Gln-^al species detected 
with antibody to ubiquitin. The arrow indicates the 
band of ub-Pro-^al, the initial fusion protein seen 
in lanes b and c. The arrowheads indicate the 
position of the band of deubiquitinated ^gal (de- 
tectable by either Coomassie staining or metabolic 
labeling, but not with antibody to xibiguitin) 
derived from the ub-Gln-^gal fusion protein. 

Figure 6 shows both prokaryotic and eukaryotic 
long-lived intracellular proteins have stabilizing 
amino acid residues at their amino-termini , whereas 
secreted proteins exhibit a complementary bias. 

(A) 208 long-lived, directly sequenced, intra- 
cellular (noncompartmentalized) proteins with 
unblocked amino-termini from both prokaryotes (77 
proteins) and eukaryotes (131 proteins) were distri- 
buted into three groups according to the nature of 
their amino-terminal residues as defined by the 
' N-end rule (Table 1) . All of the long-lived intra- 
cellular proteins examined bear exclusively stabi- 
lizing residues at their amino-termini. In panels B 
to D, analogous diagrams are presented for 243 
secreted eukaryotic proteins (B) , for 37 light and 
heavy immunoglobulin chains (C) , and for 94 secreted 
eukaryotic toxins (D) . Entries in C and D are 
subsets of entries in B. For proteins in B to D, 
the amino-termini compiled correspond, whenever the 
assignment is possible, to the most processed form 
of a protein that is still located within a 
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secretihg cell, rtia data in A to D were manually 
compiled from the entire set of complete protein 
sequences available before 1981. The same con- 
clusions have been recently reached after a more 
detailed 'and extensive, computer-assisted tabulation 
of protein amino-termini using the current National 
Biomedical Research Foundation database. The 
amino-terminal residues of Asn, Cys, His, and Trp 
were excluded from the compilation because in vivo 
half-lives of the corresponding gal proteins are 
still uiiknown (sae, however, the legend to Table 1) . 
Inclusion of the residues (Table 1) into a recently 
compilation of the same type did not change the 
original conclusion* Although the amino-terminal 
Pro was also ex;cluded from the compilation. Pro 
appears to be a stabilizing residue for ^gal (TcQjle 
1) / consistent with the frequent presence of Pro at 
the amino-termini of long-lived noncompartmentalized 
proteins. 



Results and Discussion 

Rapid in vivo de ubiquitination of a nascent ubi- 
quitin- ^ gal fusion protein 

Branched ubiquitin conjugates in which the 
carboxyl-terminal glycine. of ubiquitin moieties is 
joined via an isopeptide bond to the ^-amino groups 
of internal lysine residues in proteins apparently 
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comprise the bulk of ubiquitin conjugates in eu- 
karyotic cells. Joining of xibiquitin to the amino- 
terminal cf -amino groups of target proteins, to yield 
linear ubiquitin conjugates, may also be chemically 
feasible. See A. Hershko, et al. , PNAS USA 81 ; 7021 
(1984) . Whether or not linear ubiquitin-protein 
fusions are actually synthesized in vivo through 
posttranslational enzymatic conjugation of ubiquitin 
to protein amino-termini, such proteins can also be 
produced by constructing appropriate chimeric genes 
and expressing them in vivo. Construction of one 
such gene, which encodes yeast ubiquitin linked to 
gal of Escherichia coli, is shown in Figure 1. 

When this gene is expressed in E. coli, the 
resulting ^gal-contaiiiing protein has an apparent 
molecular mass which is approximately 6 kD greater 
that that of the control ^gal, a value consistent 
with the presence of ubiquitin in the protein 
encoded by the chimeric gene. In contrast, when the 
same gene is expressed in yeast, the corresponding 
^gal protein is electrophoretically indistinguish- 
able from the control ^gal. This result is in- 
dependent of the length of the ["^^S] methionine 
labeling period (between 1 smd 30 minutes) . 
Furthermore, determination of the amino-terminal 
residue in the putative Met-^gal (half-life, t^^^ 20 
hours) by Edman degradation of the in vivo-labeled, 
gel -purified ^-gal (Figure 2, lane d) directly 
confirmed the presence of the expected Met residue 
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(Figure 3A and Table i) at its amlno-terminus. 
independent evidence that ubiquitin cleavage off the 
fusion protein occurs immediately after the last Gly 
resxdue Of ubiguitin is presented belov. We con- 
elude that in yeast, ubiquitin is efficiently 
Cleaved off the nascent ubiquitin- fusion protein, 
yielding a deubiquitinated ^gal. CThe absence of 
the deubiquitination reaction in E. coli is con- 
sxstent with other lines of evidencei;;iicating that 
prokaryotes lack both ubiquitin and ubiquitin- 
specific enzymes ] . 

Iha Ubiquitin-^ -gal junction encod«i by ta.a 
*«eric g„,, siy-Het (Figure. 1 and 3B), t. 
"a«leal to junctions adjacent repeats 

^ the poly„b.,nitin precursor protein, „Mch Is 

it r,Tf l"'" Ubiquitin. Thus 

it is LLkeiy ttat the sane protease, as yet un- 
^aracterU^ biocheMically, 1. responsible both tor 
th. =onv«rston oc poiyublquitia Into -ature ubi- 
qu.t^ and (or the deubiquitination ot the nascent 

to inhibit the m vivo deubiquitination ot the ubi- 
^itinT.g.1 (and thereby to allov analysis ot 
»^oUc consequences ot a stable ubiquitin attach- 
ment to^ -gal, would be to convert the Het residue 
ot fi.^ « th. ubiqultln-;gal junction (Figure 3B. 
into other a^ino acid rescues (Plgure 3A,^e ' 
unexpected results of simh 

belov. ^ approach are described 
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The in vivo half -life of gal ia a function of its 
amino-terminal residue. The ATG codon which speci- 
fies the original Met residue of gal at the ubi- 
quitin junction (Figure 3B) was converted by sited 
directed mutagenesis into codons specifying 19 other 
amino acids (See Figure 3 A and TedDle 1) . These 
constructions differ exclusively in the first codon 
of ^ gal at the ublquitin- junction (Figure 3A) . 
After each of the 16 plasmids thus designed was 
introduced into yeast, analysis of the corresponding 
gal proteins pulse-labeled in vivo led to the 
following results (Figures 2, 4, and Table 1): 

1) With one exception (see below) , the effi- 
cient deubiqultlnation of the nascent ubiquitin-^gal 
occurs irrespective of the nature of the amino acid 
residue of ^gal at the ublquitln-^gal junction. 
Thus, the apparently ubiquit in-specific protease 
that cleaves the original xibiquitin-^gal protein at 
the Gly-Met junction is generally insensitive to the 
nature of the first residue of yBgal at the junction 
(Figure 3A and Table 1) . This result, in effect, 
makes it possible to expose different amino acid 
residues at the amino-termlni of the otherwise 
identical ySgal proteins produced in vivo. 

2) The in vivo half-lives of the^ gal proteins 
thus designed vary from more than 20 hours to less 
than 3 minutes, depending on the nature of the amino 
acid residue exposed at the amino-termlnus of ySgal 
(Figures 2, 4, and Table 1). Specifically, 
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deubiquitinated /Jgal proteins with either Met, Ser, 
Ala, Thr, Val, Cys or Gly at the amino-terminus have 
relatively long in vivo half-lives of 20 hours or 
more (figure 2, lanes d to g, and Table 1) , similar 
tb the half-life of a control ^gal whose gene had 
not been fused to that of ubiquitin. In striking 
contrast, the ^ gal proteins with either Arg, Lys, . 
Phe, Leu, Asp or Trp at the amino-terminus have very 
short half-lives, between approjcimately 2 minutes 
for Arg- ^ gal and approximately 3 minutes for Lys- 
^gal, Phe- /3 gal. Leu- ^ gal, Asp-^gal, Asn-^gal 
and Trp- ^ gal (Figure 2, lanes n to u, and Table 1). 
The half-life of ^gai proteins with amino-terminal 
residues of either Gin, His or Tyr is approximately 
10 minutes (Figure 2, lanes k to m, and Table 1), ^ 
while an amino-terminal lie or Glu confers on ^gal a 
half -life of approximately 30 minutes (Figure 2, 
lanes h to j , and Table 1) . Both pulse-chase and 
continuous labeling techniques were used in these 
experiments and yielded similar results. 

The set of individual amino acids can be 
ordered with respect to the half-lives that they 
confer on gal when exposed at its amino-terminus. 
The resulting rule (Table l) is referred to as the 
"W-end rule". 
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Table 1; The N-end rule 



Residue X In 
ub-X-^gal 



Radius of 
gyration 
of X(A) 



In vivo 
deublqultination 
of nascent 
ub-X-^gal ^1/2 



of X 



-^gal 



Met 
Ser 
Ala 
Thr 
Val 
Gly 
Cys 



1.80 
1.08 
0.77 
1.24 
1.29 
0 



+ 
+ 
+ 
+ 
+ 



20 hours 



He 
Glu 



1.56 
1.77 



+ 
+ 



30 minutes 



Tyr 
Gin 
His 



2.13 
1.75 



10 minutes 



Phe 
Leu 
Trp 
Asp 
Asn 
Lys 



1.90 
1.54 

1.43 

2.08 



+ 
+ 
+ 
+ 
+ 
+ 



3 minutes 
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Arg 2.38 , + 2 minutes 

^^o 1.25 7 minutes 

*The rate of in vivo deubiquitination of xib-Pro-/gal 
is extremely low. The t- shown is that of the 
initial ub-Pro-lgal fustofl protein (see Fig. 4, 
lanes J to p) • 
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Leqend to Table 1 

The N-end rule. In vivo half-lives of ^gal 
proteins in the yeast cerevisiae were determined 
either by the pulse-chase technique (for short-lived 
gal's; see belov) or by measuring the enzymatic 
activity of gal in crude extracts. For the measure- 
ments of ^gal activity, cells growing in a galactose- 
containing meditam were transferred to an otherwise 
identical medium lacking galactose and containing 10 
percent glucose. After further growth for at least 
5 hours at 30 C, the ratio of ^gal activities per 
cell before and after shift to glucose was deter- 
mined for each of the ^gal proteins. [GAL promoter- 
driven expression of the fusion genes (Figs. 1 and 
3) is repressed in glucose medium]. For shorter- 
lived ^gal proteins (t. ,3 < 1 hour), the pulse- 
chase technique was used'^Ss well (Figs. 2 and 4) . 
EJ|ctrophoretic bands of ^gal proteins labeled with 
[ S] methionine in pulse-chase experiments were cut 
out from scintillant-impregnated-dried gels similar 
to those of Figs. 2 and 4, and "^^S in the bands was 
determined. The in vivo decay of short-lived gal 
proteins deviated from first-order kinetics in that 
the rate of degradation was lower when measured at 
later (1 hour) time points of the chase, , the lower 
rate reflecting either a time-dependent toxic effect 
of cycloheximide or intrinsic characteristics of the 
in vivo degradation process. [Arrest of translation 
is required for an efficient short-term chase in S. 
cerevisiae because of the amino acid pool equili- 
bration problems related to the presence of vacuoles 
In this organism]. The half-life values listed 
below were determined for the first 10 minutes of 
chase. Several lines of evidence (see description 
of Figs. 4 and 6) suggest that Pro is a stabilizing 
residue. The listed radii of gyration of amino 
acids are from. M. Levitt, J. Mol. Biol . 104:59 
(1976). " 
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Amino-terminal loc ation of an amino acid Is es- 
sential for its effect on cral half-life 

Site-directed muta:genes±s was employed to 
insert a codon specifying a "stabilizing" amino acid 
(in this experiment, the Met residue) before the 
first codon ot pqal at the ubicpiitin-^gal jxinction 
(Table 2) . Insertion of a stabilizing residue (Met) 
before either another stabilizing residue (Thr) or a 
variety of destabilizing residues (Gin, Lys, and 
Arg) at the ubiquitin-^gal junction invariably 
results in a long-lived deubiquitlnated ^gal (Table 
2) . Furthermore, in contrast to ubiquitin-Pro-^gal 
which is not only short-lived but also resistant to 
deubiquitination (Figure 4, lanes j to p, and Table 
1) , ubiquitin-Met-Pro-^gal is efficiently deubi- 
quitinated in vivo to yield a long-lived Met-Pro- 
^gal (Table 2). These results show that both the 
identity of amino acid residue and its amino- 
terminal location (presumably the presence of a free 
a-amino group) are essential for its effect on ^gal 
half life. In addition, these results (Table 2) 
further support the expectation that ubiquitin 
cleavage off the fusion protein occurs immediately 
after the last Gly residue of ubiquitin (Figure 3A) . 
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Table 2: N-termlnal location of an amino acid is 
essential for its effect on 5gal half-life * 


Fusion protein 


tj^ of deubiquitinated . 
fusion protein 


T 

y ub - Thr - /jgal 
ub I Met - Thr - ^gal 


>20 hours 
>20 hours 


T 

Y ub - Gin « dgal 
ub I Met . Gin - flgal 


«10 minutes 
>2G hours 


1 

- ub - Lys -^gal 
ub - Met - Lys - /9gal 


*3 minutes 
>20 hours 


T 

- ub - Arg -tfgal 
ub • Met - Arg - ^gal 


»2 minutes 
>20 hours 


^ ub - Pro - flgal 
ub - Met - Pro - ^'gal 


* 

a? minutes 
>20 hours 



Amlno-terninal location of an amino acid is essen* 
tial for its effect on ^gal half-life. The inser- 
tion mutants were obtained essentially as described 
for the initial set of mutants except that a 32- 
residue oligonucleotide, 5 '-CCCGGGATCCGTGC(G/C/ 
T/) (G/T)CATACCACCTCTTAG was used, containing 14 
bases on the 5» side and 15 bases on the 3» side of 
the eunbiguous codon inserted behind the Met codon. 
Bases in parentheses denote ambiguities at the 
positions 16 and 17 in the sequence. Half -lives of 
the corresponding ^gal proteins were determined as 
described in the legend to Table 1. 
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A long-lived cleavage product, of cyal Is formed 
daring decay of short-lived ^gal proteins. 

The electrophoretic patterns of short-lived 

(but not of long-lived) ^gal proteins invariably 
contain a specific, about 90 kD cleavage product of 

^gal (Figxire 2, lames n to u) which, unlike the 
parental ^gal specieis, accumulates during the 
postlabeling (chase) period (Figure 4, lanes m-p) - 
The 90 KD ^gcil fragment constitutes a relatively 
small proportion of the initial amount of the 
pulse-labeled ^gal. . Nonetheless, its existence 
LmplLBS that an in vivo endoproteolytic cleavage Ccui 
rescue a protein fragment from the metabolic fate of 
its short-lived parenteil protein. It remains to be 
seen whether the resulting possibility of multiple 
lialf*lives within a single protein species is 
exploited in the design of naturally short-lived 
proteins. 

Ubiquitin-^gal i s short-lived when not deubicnii- 
tinated, 

Ubiquitih-Pro-^gal, the only ubiquitin-^gal 
fusion that is not deubiqoitinated in vivo (Fig. 4, 
lanes j to p) , has a half-life of approximately 7 
minutes (Table 1) which is less than 1 percent of 
the half -life of metabolically stable ;3 gal proteins 
(Table 1) • One interpretation of this result is 
that a metabolically stable ubiquitin attachment to 
protein amino-termini is sufficient to signal 
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degradation of acceptor proteins. This inter- 
pretation is consistent with earlier biochemical and 
genetic evidence that ubiquitination of short-lived 
proteins in a mammalian cell is essential for their 
degradation. At the same time, all ubiquitin-^gal 
fusion proteins other than ubiquitin-Pro-^gal are 
rapidly detibiquitinated in vivo (Table 1) . Thus, 
the posttranslational amino-termal ubiquitination of 
proteins may not be involved in an initial recogni- 
tion or commitment step that designates proteins for 
degration in vivo. Whether posttranslational 
amino-terminal ubiquitination (if it actually occurs 
in vivo) is essential for later stages of the 
degradation pathway remains to be determined - 
Earlier in vitro experiments indicated that prefer- 
ential chemical modification of amino-termini of 
proteolytic substrates inhibits their degradation in 
an in vitro tibiquitin-dependent proteolytic system. 
Based on these data, it was proposed that amino-ter- 
minal ubiquitination of proteins is essential for 
their degradation. An alternative interpretation of 
the same results is that chemical blocking of 
proteins* amino-termini prevents the recognition of 
their amino-terminal residues by the "M-end rule" 
pathway whose initial stages are not necessarily 
ubiquit in-dependent . 
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Short-lived ^ggal pr oteins are multiply ublguitinated 
in vivo , . . 

Over exposures of the pulse-chase fluorograms 
(Fig. 2) reveal that the major band of a deubi- 
quitinated, short-liyed ^gal protein coexists with a 
"ladder" of larger molecular mass, ^gal-containing 
bands irregularly spaced at 4 to 7 )cD intervals 
(Fig. 4, lanes c to g) . Np such larger species 
appear when the fluorograms of long-lived^ gal 
proteins are similarly overexposed (Fig. 4, lanes a 
and b) . Xmmunological analysis with both antibodies 
to ^ gal and antibodies to ubiquitin demonstrates 
that the " ladder" ^gal species contain ubiquitin 
(Fig. 5) . 

A model for the s elective degradation pathway . 

With, the exception of natural or engineered 
ubiquitin fusion proteins (Fig. i and Table 1) 
nascent proteins apparently lack ubiquitin moieties. 
The in vivo amind-terminal processing of nascent 
noneompartmentalized proteinis generates their mature 
amino-termini via the action of amino-termihal 
peptidases whose substrate specificities have been 
partially characterized, (see Tsunasawa, S. et al. 
J. Biol. 260 5382 (1985) ; Boissel, J.P. et al. 

PNAS USA 82, 8448 (1985)). We suggest that the 
amino-termini thus generated are recognized by an 
"N-end-readirig" enzyme. One specific model is that 
a commitment to degrade a protein molecule is made 
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as a result of the recognition of its amino-terminal 
residue by a stochastically operating enzyme whose 
probability of "clamping" at the target's amino- 
tenninus is determined by the N-end rule. (Table 1) . 
Once the commitment is made, it is followed by a 
highly processive ubiquitination of the target 
protein which in the case of ^gal is conjugated to 
more than 15 ubiquitin moieties per molecule of gal 
(Fig. 4, lanes c to g, and Fig. S) . The multiply 
ubiquitinated target protein is then degraded by a 
"down stream" enzyme (1) for which the ubiquitin 
moieties of the target serve as either recognition 
signals or denaturation (unfolding) devices, or 
both. 

The ubiquitin-contalning "ladder" ^gal species 
(Fig. 4, lanes c to 1, and Fig. 5) consist of 
apparently branched ubiquitin moeities joined to the 
f -amino groups of internal lysine residues in /J gal. 
Surprisingly, the "ladder" ^gal species derived from 
ubiquitin-Pro-^gal are electrophoretically indistin- 
guishable from the analogous species of ^gal whose 
amino-terminal ubiquitin is cleaved off the nascent 
fusion protein (Fig. 4, lanes j to 1, and Fig. 5). 
If the electrophoretically indistinguishable ubiqui- 
tinated gal species are indeed structurally homo- 
logous, these results would be compatible with two 
alternative models in which, immediately after the 
first ubiquitins are branch-conjugated to ySgal, 
either a branch-ubiquitinated ubiquitin-Pro-^gal 
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undergoes amino-terminal deiibiquitination or, 
alternatively, and analogous ^gal species lacking 
the amino-terminal xibiquitin moiety reacquires it. 
Eicperimental resolution of this ambiguity may 
establish whether the posttranslational amino- 
terminal ubiquitination of proteins (if it occurs in 
vivo) plays a role in tie selective protein turn- 
over. 

'■ in*' 

Although both prokaryotic and eukaryotic 
proteins appear to follow the N-end rule (see 
below) , bacteria apparmtly lack the ubiquitin 
system. Thus it is possible that the hypothetical 
N-end-recognizing protein is more strongly conserved 
between prokaryotes and eukaryotes than is the rest 
of the selective degradation pathway. Interest- 
ingly, the properties of a mammaliam protein E3 
whose presence is required for ubiquitination of 
proteolytic sxibstrates by iibiquitin-conjugating 
enzymes in vitro are consistent with it being a 
component of the K-end-recognizing protein. 

The y-end rule an d the known amino-termini of 
intracellular proteins - 

The unblocked- amino-term'inal residues in 
metabolically stable, noncompartmentalized proteins 
from both prokaryotes and eukaryotes are exclusively 
(Fig. 6A) of the stabilizing class (Met, Ser, Ala, 
Gly, Thr, Val), that is, the class that confers long 
in vivo half-lives on gal (Table 1). The one 
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short-lived intracellular protein for which the 
mature amino-tenninus Is known is the cll protein of 
phage leunbda, the central component of a trigger 
that determines whether A grows lytically or 
lysogenizes an infected cell. (Y.s. Ho, D. Wulff, 
M. Rosenberg, in Regulation of Gene Expression , I. 
Booth and C. Higgins, Eds. (Ccunbridge Univ. Press, 
London, 1986), p. 79; F. Banuett, M.A^ Hoyt, L. 
McFarlane, H. Echols, I. Herskowitz, J. Mol. Biol> 
187, 213 (1986); M.A. Hoyt> D.M. Knight, A. Das, 
H.I. Miller, H. Echols, Cell 31, 565 (1982) ; K. 
Nasmyth, Nature (london) 320, 670 (1983)). The 
half-life of oil in lambda- infected E. coli is less 
than 3 minutes. Strikingly, the mature amino- 
terminus of cll starts with Arg (Ho, Y.W. et al . , J. 
Biol. Chem . 257 , 9128 (1982)), the most destablizing 
residue in the N-end rule (Table 1) . 

While the destabilizing amino acids can be 
either hydrophobic, \incharged hydrophilic or charg- 
ed, they share the property of having larger radii 
of gyration than any of the stedDilizing amino acids 
except Met (Table 1) . 

Amino-terminal residues in compartmentalized pro- 
teins are larcrely of the destabilizing class . 

Figxire 6 illustrates a striking difference 
between the choice of amino-terminal residues in 
long-lived, noncompartmentalized intracellular 
proteins (A) and in compartmentalized proteins, such 
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as secreted proteins (B) , many of which are also 
long-lived in their respective extracellular com- 
partments. One in^lication of this finding is that 
a single intracellular degradation pathway operating 
according to the H-end rule could be responsible 
both for the diversity of in vivo half-lives of 
intracellular proteins and for the selective de- 
struction of compartmentalized proteins that are 
aberrantly introduced into the intracellular space. 
Some miscompau^entalized proteins may be more 
harmful to the cell than others* it is therefore of 
interest that secreted eukaryotio toxins contain 
strongly destabilizing residues (Arg, Lys, Leu, Phe, 
Asp) at their amino-termini more often than the 

general population of . secreted proteins (Pig. 6, 
panels B. to D) .. . 

The above consideration also suggest that, if 
the topological outside of a cell, such as lumens of 
the endoplasmic reticulum and Golgi> and the extra- 
cellular, space, were to have degradation pathways 
analogous ta the N-end rule pathway, they could be 
based on "inverted" versions of the N-end rule in 
which the amino-terminal residues that are de- 
stabilizing inside the cell are now the stabilizing 
ones and vice versa. Thus, the methods of the 
present invention should also be useful for manipu- 
lating the metabolic stability and; other properties 
of compartmentalized proteins, including secreted 



ones. 
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Possible r ole of the N-end rule- pathway In the 
turnover of long-lived proteins . 

Long-lived intracellular proteins with destabilizing 
(Table 1) penultimate residues generally retain 
their initial amino-terminal methionine residue. 
The amino-terminal residues in long-lived intracel- 
lular proteins that do undergo amino-terminal 
processing are invariably of the stabilizing class 
(Table 1) . An interesting possibility that would 
involve the N-end rule pathway in the turnover of 
long-lived proteins is that the rate-limiting step 
in the in vivo degradation of long-lived proteins 
may be a slow aminopeptidase cleavage that exposes a 
destabilizing residue, followed by rapid degradation 
via the N-end rule pathway. Note that fine-tuning 
of the rate of degradation may in this case be a 
function of the rate of aminopeptidase cleavage 
exposing a destabilizing residue rather than a 
function of the residue's destabilizing capacity 
according to the N-end rule. 

The N-end rule a nd selective degradation of short- 
lived and damacred proteins . 

The recognition of polypeptide chain folding 
patterns or of .local chemical features that target 
an otherwise long-lived but damaged protein for 
selective degradation in vivo is unlikely to be 
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mediated directly by the N-end rule pathway. 
Instead, we suggest that specific proteases Cana- 
logous in function to nucleases that recognize 
specific lesions in DNA) cleave a targeted protein 
so as to expose a destabilizing residue at the 
amino-terminus of one of the two products of a cut. 
One testable prediction of this model is that the 
initial cleavage products of the degradation pathway 
should bear destabilizing residues at their N-ter- 
mini. The preferential exposure of destabilizing 
residues at the aiaino-^:eniiini of products of the 
initial protein cleavageis may be due either to 
intrinsic specificities of the proteases involved or 
simply to the fact that a majority of "the amino 
acids belong to the destabilizing cleiss (Table 1). 
Furthermore, initieO. cleavages of a protein would be 
expected to destcibilize aspects of its original 
conformation, thus increasing the probability of 
further internal cuts. Whether the initial cleavage 
products of a protein woiild be degraded exclusively 
via the N-end rule pathway or vould have to be 
processed further by additional internal cleavages 
should depend on several factors, such as the 
exposure of -destabilizing residues at the amino- 
termini of initial cleavage products, and the 
relative rates of introduction of internal cuts, in 
this model, the N-end rule pathway should be es- 
sential for degradation of most of the metabolically 
unstable proteinis, from chemically damaged. 
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prematurely terminated, improperly folded and 
miscompartmentalized ones to those that cannot 
assemble into native multisubunit aggregates, and 



short-lived in vivo. Thus, the metabolic in- 
stability of a protein may be mediated not only by 
the exposure of a destabilizing residue at its 
amino-terminus, but also by local conformational and 
chemical features of its polypeptide chain that 
result in proteolytic cleavages exposing destabiliz- 
ing residues at the suniho-termini of cleavage 
products . 

For any given protein, a variety of factors in * 
addition to the N-end rule may combine to modulate 
its half-life in vivo. Among such factors may be 
the flexibility and accessibility of the protein's 
amino-terminus (Thornton, J.M. and Sibanda, B.L. , J. 
Mol. Bio . 167 443 (1983)), the presence of chemical- 
ly blocking amino-terminal groups such as the acetyl 
group, the distribution of ubiquitinatable lysine 
residues near the amino-teriainus , and other variabl- 
es, such as the structure of the carboxy-terminus . 
Since eunino-terminal regions of multisubunit pro- 
teins are commonly involved in the interfaces 
between subunits (Thornton, J.M. and Sibanda, B.L., 
J. Mol. Bio . 167 443 (1983)), quartemary structure 
of proteins is yet another parameter that is ex- 
pected to modulate the impact of the N-end rule 
pathway on protein half -lives in vivo. Finally as 



finally to otherwise normal proteins that are 
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suggested above, the N-end rule pathway may also be 
essential for the degradation of proteins whose 
initial recognition as targets for degradation is 
independent of the structures at their amino- 
termini. 

Functional signi ficance of piasttranslational addi- 
tion of amino aelda to a^^^ no-termini of proteins . 
It has been kndvn for many years that in both 
bacteria and eukaryotes there exists an unusual 
class of enzymes, aminoacyl-transfer RN&-protein 
transferases, which catalyze posttranslational 
conjugation of specific amino acids to the mature 
amino-termlni of acceptor proteins in vitro (R.L. 
Soffer, in Transfer R N&: Biological Aspects . D. Soli, 
J.N. Abelson, P.R. schtmmel, Eds. (Cold Spring 
Harbor Laboratory, Cold Spring Harbor, NY 1^80) , 
p493;C. Deutch, Methods Enzy mol . io6, 198(1984): A. 
Kaji, H. Kaji, g.O. Novelli, j. Biol. Chem . 240 . 
1185 (1965)). The' posttranslational addition of 
amino acids to proteins in vivo dramatically ac- 
celerates in a stressed or regenerating tissue, for 
example, after physical injury to axons of nerve 
cells (S. Shyne-Athwal, R.v. Riccio, G. Chakraborty, 
H.A. ingolia. Science 231, 603 (1986) ; N.A. Ingolia 
et al. , J. Neurosci 3, 2463 (1983) ) . The N-end rule 
provides an explanation for this phenomenon. Me 
suggest that selective changes in metabolic sta- 
bUity of otherwise undamaged, longlived proteins 



• 
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that may be required by a changed physiological 
state of the cell are brought about by post- 
translational addition of destabilizing amino acids 
to the amino-termini of target proteins in vivo. 
Strikingly, the known reactions of posttranslational 
addition of amino acids to proteins (R.L. Soffer, in 
Transfer RNA; Biological Aspects , D. Soil, J.N. 
Abelson, P.R. Schimmel, Eds. (Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY 1980), p493;C. 
Deutch, Methods Enzymol . 106 , 198(1984): A. Kaji, H. 
Kaji, G.D. Novelli, J. Biol. Chem . 240 , 1185 (1965); 
S. Shyne-Athwal, R.V. Riccio, G. Chakraborty, N.A. 
Ingolia, Science 231 . 603 (1986) ; N.A. Ingolia et 
al. , J, Neurosci 3, 2463 (1983)) involve largely 
those amino acids (Arg, Lys, Leu, Phe, and Tyr) that 
are destabilizing according to the N-end rule (Table 
1) . Physiological states in which addition of 
destabilizing amino acids to proteins could be 
expected to occur include entry to and exit from the 
cell cycle, responses to chemical or physical 
stress, and specific differentiation events, such as 
erythroid differentiation and spermatogenesis, in 
which a proportion of preexisting, otherwise long- 
lived intracellular proteins is selectively de- 
graded*. 

The in vitro degradation of some proteolytic 
substrates in a ubiquitin-dependent system from 
mammalian reticulocytes has recently been shown to 
depend on the presence of certain aminoacyl-tRNAs 
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(Ferber, s. and Crechanover, A., j. Biol . Chem. 261 
3128 C1986) ) . We suggest that this phenomenon also- 
reflects a requirement for posttranslational ad- 
dition of specific destabilizing amino acids to the 
amino-termini of proteolytic substrates. The 
initial proteolytic substrates in question have 
amino-terminal residues of Asp or Glu, both of which 
are destabilizing according to the N-end rule 
(Table 1) . This raises an interesting and testable 
possibility that certain amino-terminal residues in 
. proteins may not be directly destabilizing as such 
but only through their ability to be conjugated to 
other destabilizing residues. 

Equivalents 

Those skilled in the art will recognize, or be 
able to ascertain using no more than routine experi- 
mentation, many equivalents to the specific embodi- 
ments of the invention described herein. Such 
equivalents are intended to be encompasled by the 
following claims. 
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A method of regulating the stability of an 
intracellular protein, comprising modifying the 
amino-terminus of a protein to provide for 
exposure at the amino-terminus of a desired 
amino acid of the stabilizing or destabilizing 
class according to the N-end rule of protein 
degradation. 

A method of destabilizing an intracellular 
protein, comprising modifying the protein to 
provide an amino-terminal amino acid of the de- 
stabilizing class. 

A method of Claim. 2, wherein an amino acid of 
the destcUbilizing class is selected from the 
group consisting of Isoleucine, Glutamic acid. 
Tyrosine, Glutamine, Phenylalamine, Leucine, 
Asparginine, Lysine or Arginine. 

A method of producing a protein which is more 
or less resistamt to degradation than its 
wild-type coxinterpart, comprising introducing 
into the amino-terminus of the wild-type 
protein a desired amino acid of the stabilizing 
or destabilizing class according to the N-end 
rule of protein degradation to produce a 
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modified protein having a staO^ilizing or 
destabilizing amino acid at its amino-terminus • 

5. A method of Claim 4, wherein the modified 
protein is produced as a fxision protein con- 
taining a masking protein joined to its amino- 
terminus^ the masking protein being cleavable 
at the junction with the aminq*terminus of the 
modified protein. 

6. A method of Claim 5, wherein the masking 
protein is xibiquitin. • 

7. A method of Claim 6, wherein ubiquitin-specif ic 
protease is used to cleave ubiquitin from the 

. fusion protein at the ubiquitin-protein junc- 
tion either in vivo or in vitro. 

8. A method of Claim 4, wherein the masking 
protein is joined to the amino-terminus of the 
modified protein through a protein sequence 
comprising a site for recognition by an endo- 
protease. 

9. A method of Claim 8, wherein the endoprotease 

is complement factor X . 

a 

■J 

10. A method of producing a metabolically unstable 
protein comprising producing the protein as a 
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fusion protein wherein xibic[uitin is joined to 
the amino-tenainus of the protein in a manner 
such that the fusion protein cannot be ef- 
ficiently deubiquitinated. 

11. A method, of Claim 10, wherein the fusion 
protein' comprises ubiquitin-proline-protein. 

12. A method of Claim 10, wherein the fusion 
protein contains ubiquitin having a modifica- 
tion in the amino acid sequence recognized by 
ubiquitin-specific processing protease so as to 
reduce the efficiency of deubiquitination of 
the fusion protein by the protease. 

13. A gene construct encoding a fusion protein, the 
gene construct comprising a DNA sequence 
encoding a madiJcing protein joined to a DNA 
sequence encoding a protein of interest^ having 
a predetermined amino acid of the stabilizing 
or destabilizing class at its amino-teriinus, 
the masking protein being cleavsUsle at the 
junction with the amino-terminus of the protein 
of interest such that cleavage results in 
exposure of the amino-terminus of the protein 
of interest. 



14. A method of Claim 13, wherein the masking 
protein is ubiguitin. 
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A method of producing a protein having a 
predetermined amino-terminal structure, com- 
prising expressing the protein as a fttsion 
protein wherein the amino-terminus of the 
protein is fused to a masking protein which is 
specifically cleayable in vivo or in vitro at 
the junction with the amino-terminal amino acid 
of the protein. 

16. A method of Claim 15, wherein the masking 
protein is cleavable by a endoprotease. 

17. A method of Claim 16, wherein the masking 
protein is ubiquitin and the cleaving endo- 
protease is ubiquitin-specific protease. 



a 



A method of producing a protein having a 
predetermined amino-terminal structure, com- 
prising: 

preparing a DNA construct encoding a 
fusion protein comprising: 

i) a structural gene encoding the 
protein, the gene having a sequence 
at its 5« end encoding the pre- 
determined amino-terminca structure 
of the protein; and 

ii) DNA encoding a masking protein linked 
to the 5* end of the structural gene 
encoding the protein, the masking 
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protein being specifically cleaveOjle 
in vivo or in vitro at. its junction 
with the amino-terminus of the 
protein; and 
b. expressing the DNA construct in a host 

cell to produce the fusion protein encoded 
by the DNA construct. 



19. A method of Claim 18, wherein the fusion 
protein is cleaved within the host cell to 
release the protein having the predetermined 
amino-terminal structure. 

20. A method of Claim 18, wherein the fusion 
protein is produced in a cell which lacks the 
capEQjility to cleave the masking protein and 
the fusion protein is cleaved in vitro to 
release the protein having the predetermined 
€unino-terminal structure. 

21. A method of Claim. 18, wherein the masking 
protein is xibiguitin. 

22. A DNA construct for producing a protein having 
a predetermined amino-terminal s1:ructure, 
comprising: 

a. a structural gene encoding the- protein, 
the gene having a sequence at its 5» end 
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encoding the predetermined amino-terminal * 
structure of. the protein; and 
b. DNA encoding a masking protein linlced to 

th& 5» end of the structural gene encoding 
the protein, the masking protein being 
specifically cleavable in vivo or in vitro 
at its jxinction with the amino-terminus of 
the protein. 

23. A method of Claim 22, wherein the masking 
protein is ubiquitin. 

24. Isolated iibiquitin-specific protease capable of 
cleaving ubiquitin from the ami no-terminus of a- 
protein^ to which it is joined at the junction 
between ubiquitin and the amino-terminus of the 
protein. 

25. An isolated DNA sequence encoding ubiquitin- 
specific protease, of Claim 24. 



A mutant cell line which is conditionally or 
unconditionally incapable of synthesizing one 
or more proteases involved in the N-end rule 
degradative pathway. 
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